Picture for Wei Ji

Wei Ji

Turing Patterns for Multimedia: Reaction-Diffusion Multi-Modal Fusion for Language-Guided Video Moment Retrieval

Add code
Jun 01, 2026
Viaarxiv icon

Explainable Forensics of Manipulated Segments in Untrimmed Long Videos

Add code
Jun 01, 2026
Viaarxiv icon

GIRL-DETR: Gradient-Isolated Reinforcement Learning for Video Moment Retrieval

Add code
May 30, 2026
Viaarxiv icon

Immuno-VLM: Immunizing Large Vision-Language Models via Generative Semantic Antibodies for Open-World Trustworthiness

Add code
May 29, 2026
Viaarxiv icon

Towards Unified Vision-Language Models with Incomplete Multi-Modal Inputs

Add code
May 27, 2026
Viaarxiv icon

ConceptSeg-R1: Segment Any Concept via Meta-Reinforcement Learning

Add code
May 19, 2026
Viaarxiv icon

Towards Unified Surgical Scene Understanding:Bridging Reasoning and Grounding via MLLMs

Add code
May 13, 2026
Viaarxiv icon

RADAR: Redundancy-Aware Diffusion for Multi-Agent Communication Structure Generation

Add code
May 11, 2026
Viaarxiv icon

TexEditor: Structure-Preserving Text-Driven Texture Editing

Add code
Mar 19, 2026
Viaarxiv icon

Selective Noise Suppression and Discriminative Mutual Interaction for Robust Audio-Visual Segmentation

Add code
Mar 15, 2026
Viaarxiv icon